Scorecard construction with unbalanced class sizes
نویسندگان
چکیده
A long-running issue in scorecard construction in retail banking is how to handle dramatically unbalanced class sizes. This is important because, in many applications, the class sizes are very different. We describe the impact ignoring such imbalance can have and review the various strategies which have been proposed for tackling it, embedding them in a common theoretical framework. We then describe a new ’local’ method of scorecard construction which both theory and our experiments show yields superior performance to standard methods, while retaining their interpretative simplicity. We illustrate using real banking data sets.
منابع مشابه
Scorecard construction with unbalanced class sizes
A long-running issue in scorecard construction in retail banking is how to handle dramatically unbalanced class sizes. This is important because, in many applications, the class sizes are very different. We describe the impact ignoring such imbalance can have and review the various strategies which have been proposed for tackling it, embedding them in a common theoretical framework. We then des...
متن کاملGraph-based Learning with Unbalanced Clusters
Graph construction is a crucial step in spectral clustering (SC) and graph-based semi-supervised learning (SSL). Spectral methods applied on standard graphs such as full-RBF, ǫ-graphs and k-NN graphs can lead to poor performance in the presence of proximal and unbalanced data. This is because spectral methods based on minimizing RatioCut or normalized cut on these graphs tend to put more import...
متن کاملA NEW CLASS OF UNBALANCED HAAR WAVELETS THAT FORM AN UNCONDITIONAL BASIS FOR Lp ON GENERAL MEASURE SPACES
Given a complete separable nite measure space (X; ; ) and nested partitions of X, we construct unbalanced Haar-like wavelets on X that form an unconditional basis for Lp(X; ; ) where 1 < p <1. Our construction and proofs build upon ideas of Burkholder and Mitrea. We show that if (X; ; ) is not purely atomic, then the unconditional basis constant of our basis is (max(p; q) 1). We derive a fast a...
متن کاملA note on searching sorted unbalanced three-dimensional arrays
We examine the problem of searching sequentially for a desired real value (a key) within a sorted unbalanced three-dimensional finite real array. This classic problem can be viewed as determining the correct dimensional threshold function from a finite class of such functions within the array, based on sequential queries that take the form of point samples. This note addresses the challenge of ...
متن کاملModel-Free Gene Selection Method by Considering Unbalanced Samples
In gene expression data analysis, discriminator genes are importantly informative genes for further research. Recently, a great deal of research has focused on the challenging task of identifying these informative genes from microarray data. However, the sizes of sample classes in microarray data are often unbalanced. The unbalance of samples has not been explicitly and correctly considered by ...
متن کامل